IRIX Base Documentation 2002 November

home *** CD-ROM | disk | FTP | other *** search

/ IRIX Base Documentation 2002 November / SGI IRIX Base Documentation 2002 November.iso / usr / share / catman / p_man / cat3 / SCSL / dgemms.z / dgemms

Wrap

Text File | 2002-10-03 | 12.7 KB | 331 lines

DDDDGGGGEEEEMMMMMMMMSSSS((((3333SSSS)))) DDDDGGGGEEEEMMMMMMMMSSSS((((3333SSSS)))) NNNNAAAAMMMMEEEE DDDDGGGGEEEEMMMMMMMMSSSS - Multiplies a real general matrix by a real general matrix, using Strassen's algorithm SSSSYYYYNNNNOOOOPPPPSSSSIIIISSSS Fortran: CCCCAAAALLLLLLLL DDDDGGGGEEEEMMMMMMMMSSSS ((((_t_r_a_n_s_a,,,, _t_r_a_n_s_b,,,, _m,,,, _n,,,, _k,,,, _a_l_p_h_a,,,, _a,,,, _l_d_a,,,, _b,,,, _l_d_b,,,, _b_e_t_a,,,, _c,,,, _l_d_c)))) C/C++: ####iiiinnnncccclllluuuuddddeeee <<<<ssssccccssssllll____bbbbllllaaaassss....hhhh>>>> vvvvooooiiiidddd ddddggggeeeemmmmmmmmssss ((((cccchhhhaaaarrrr *_t_r_a_n_s_a,,,, cccchhhhaaaarrrr *_t_r_a_n_s_b,,,, iiiinnnntttt _m,,,, iiiinnnntttt _n,,,, iiiinnnntttt _k,,,, ddddoooouuuubbbblllleeee _a_l_p_h_a,,,, ddddoooouuuubbbblllleeee *_a,,,, iiiinnnntttt _l_d_a,,,, ddddoooouuuubbbblllleeee *_b,,,, iiiinnnntttt _l_d_b,,,, ddddoooouuuubbbblllleeee _b_e_t_a,,,, ddddoooouuuubbbblllleeee *_c,,,, iiiinnnntttt _l_d_c); IIIIMMMMPPPPLLLLEEEEMMMMEEEENNNNTTTTAAAATTTTIIIIOOOONNNN These routines are part of the SCSL Scientific Library and can be loaded using either the ----llllssssccccssss or the ----llllssssccccssss____mmmmpppp option. The ----llllssssccccssss____mmmmpppp option directs the linker to use the multi-processor version of the library. When linking to SCSL with ----llllssssccccssss or ----llllssssccccssss____mmmmpppp, the default integer size is 4 bytes (32 bits). Another version of SCSL is available in which integers are 8 bytes (64 bits). This version allows the user access to larger memory sizes and helps when porting legacy Cray codes. It can be loaded by using the ----llllssssccccssss____iiii8888 option or the ----llllssssccccssss____iiii8888____mmmmpppp option. A program may use only one of the two versions; 4-byte integer and 8-byte integer library calls cannot be mixed. The C and C++ prototypes shown above are appropriate for the 4-byte integer version of SCSL. When using the 8-byte integer version, the variables of type iiiinnnntttt become lllloooonnnngggg lllloooonnnngggg and the <<<<ssssccccssssllll____bbbbllllaaaassss____iiii8888....hhhh>>>> header file should be included. DDDDEEEESSSSCCCCRRRRIIIIPPPPTTTTIIIIOOOONNNN DDDDGGGGEEEEMMMMMMMMSSSS multiplies a double precision general matrix by a double precision general matrix. This routine is an implementation of the Winograd's variation of Strassen's algorithm for matrix multiplication. Because of a very different order of operations performed by the Strassen's algorithm, numerical results from DDDDGGGGEEEEMMMMMMMMSSSS may differ slightly from those of DDDDGGGGEEEEMMMMMMMM. DDDDGGGGEEEEMMMMMMMMSSSS is functionally equivalent to DDDDGGGGEEEEMMMMMMMM, but it does require temporary space which it allocates and manages automatically. This routine performs one of the matrix-matrix operations: _C <- _a_l_p_h_a _o_p(_A) _o_p(_B) + _b_e_t_a _C PPPPaaaaggggeeee 1111 DDDDGGGGEEEEMMMMMMMMSSSS((((3333SSSS)))) DDDDGGGGEEEEMMMMMMMMSSSS((((3333SSSS)))) where _o_p(_X) is one of the following: _o_p(_X) = _X _o_p(_X) = _X_T where * _a_l_p_h_a and _b_e_t_a are scalars * _A, _B, and _C are matrices * _o_p(_A) is an _m-by-_k matrix * _o_p(_B) is a _k-by-_n matrix * _C is an _m-by-_n matrix * _XT is the transpose of _X See the NOTES section of this man page for information about the interpretation of the data types described in the following arguments. This routine has the following arguments: _t_r_a_n_s_a Character. (input) Specifies the form of _o_p(_A) to be used in the matrix multiplication, as follows: _t_r_a_n_s_a = 'N' or 'n': _o_p(_A) = _A _t_r_a_n_s_a = 'T' or 't': _o_p(_A) = _A_T _t_r_a_n_s_a = 'C' or 'c': _o_p(_A) = _A_T For C/C++, a pointer to this character is passed. _t_r_a_n_s_b Character. (input) Specifies the form of _o_p(_B) to be used in the matrix multiplication, as follows: _t_r_a_n_s_b = 'N' or 'n': _o_p(_B) = _B _t_r_a_n_s_b = 'T' or 't': _o_p(_B) = _B_T _t_r_a_n_s_b = 'C' or 'c': _o_p(_B) = _B_T For C/C++, a pointer to this character is passed. _m Integer. (input) Specifies the number of rows in matrix _o_p(_A) and in matrix _C. _m must be >= 0. PPPPaaaaggggeeee 2222 DDDDGGGGEEEEMMMMMMMMSSSS((((3333SSSS)))) DDDDGGGGEEEEMMMMMMMMSSSS((((3333SSSS)))) _n Integer. (input) Specifies the number of columns in matrix _o_p(_B) and in matrix _C. _n must be >= 0. _k Integer. (input) Specifies the number of columns of matrix _o_p(_A) and the number of rows of matrix _o_p(_B). _k must be >= 0. _a_l_p_h_a Double precision. (input) Scalar factor. _a Double precision array of dimension (_l_d_a,_k_a). (input) When _t_r_a_n_s_a = 'N' or 'n', _k_a is _k; otherwise, it is _m. Contains the matrix _A. Before entry with _t_r_a_n_s_a = 'N' or 'n', the leading _m-by-_k part of array _a must contain matrix _A; otherwise, the leading _k-by-_m part of array _a must contain matrix _A. _l_d_a Integer. (input) Specifies the first dimension of _a as declared in the calling (sub)program. When _t_r_a_n_s_a = 'N' or 'n', _l_d_a >= MMMMAAAAXXXX(1,_m); otherwise, _l_d_a >= MMMMAAAAXXXX(1,_k). _b Double precision array of dimension (_l_d_b,_k_b). (input) When _t_r_a_n_s_b = 'N' or 'n', _k_b is _n; otherwise, it is _k. Contains the matrix _B. Before entry with _t_r_a_n_s_b = 'N' or 'n', the leading _k-by-_n part of array _b must contain matrix _B; otherwise, the leading _n-by-_k part of array _b must contain matrix _B. _l_d_b Integer. (input) Specifies the first dimension of _b as declared in the calling (sub)program. When _t_r_a_n_s_b = 'N' or 'n', _l_d_b >= MMMMAAAAXXXX(1,_k); otherwise, _l_d_b >= MMMMAAAAXXXX(1,_n). _b_e_t_a Double precision. (input) Scalar factor. When _b_e_t_a is supplied as 0, _c need not be set on input. _c Double precision array of dimension (_l_d_c,_n). (input and output) Contains the matrix _C. Before entry, the leading _m-by-_n part of array _c must contain matrix _C, except when _b_e_t_a is 0; in which case, _c need not be set. On exit, the _m-by-_n result matrix overwrites array _c. PPPPaaaaggggeeee 3333 DDDDGGGGEEEEMMMMMMMMSSSS((((3333SSSS)))) DDDDGGGGEEEEMMMMMMMMSSSS((((3333SSSS)))) _l_d_c Integer. (input) Specifies the first dimension of _c as declared in the calling (sub)program. _l_d_c >= MMMMAAAAXXXX(1,_m). NNNNOOOOTTTTEEEESSSS This routine is an extension to the Level 3 BLAS. This routine is a modified version of the package developed through the PRISM project for multiplying matrices using Strassen's algorithm. Please see hhhhttttttttpppp::::////////wwwwwwwwwwww....mmmmccccssss....aaaannnnllll....ggggoooovvvv////PPPPrrrroooojjjjeeeeccccttttssss////PPPPRRRRIIIISSSSMMMM for more details. DDDDaaaattttaaaa TTTTyyyyppppeeeessss The following data types are described in this documentation: TTTTeeeerrrrmmmm UUUUsssseeeedddd DDDDaaaattttaaaa ttttyyyyppppeeee Fortran: Array of dimensions (_m,_n) xxxx((((mmmm,,,,nnnn)))) Character CCCCHHHHAAAARRRRAAAACCCCTTTTEEEERRRR Integer IIIINNNNTTTTEEEEGGGGEEEERRRR (IIIINNNNTTTTEEEEGGGGEEEERRRR****8888 for ----llllssssccccssss____iiii8888[[[[____mmmmpppp]]]]) Double precision DDDDOOOOUUUUBBBBLLLLEEEE PPPPRRRREEEECCCCIIIISSSSIIIIOOOONNNN C/C++: Array of dimensions (_m, _n) xxxx[[[[mmmm****nnnn]]]] Character cccchhhhaaaarrrr Integer iiiinnnntttt (lllloooonnnngggg lllloooonnnngggg for ----llllssssccccssss____iiii8888[[[[____mmmmpppp]]]]) Double precision ddddoooouuuubbbblllleeee Note that you can explicitly declare multidimensional C/C++ arrays provided that the array dimensions are swapped with respect to the Fortran declaration (e.g., xxxx[[[[nnnn]]]][[[[mmmm]]]] in C/C++ versus xxxx((((mmmm,,,,nnnn)))) in Fortran). To avoid a compiler type mismatch error in C++ (or a compiler warning message in C), however, the array should be cast to a pointer of the appropriate type when passed as an argument to a SCSL routine. SSSSEEEEEEEE AAAALLLLSSSSOOOO IIIINNNNTTTTRRRROOOO____SSSSCCCCSSSSLLLL(3S), IIIINNNNTTTTRRRROOOO____BBBBLLLLAAAASSSS3333(3S) IIIINNNNTTTTRRRROOOO____CCCCBBBBLLLLAAAASSSS(3S) for information about using the C interface to Fortran 77 Basic Linear Algebra Subprograms (legacy BLAS) set forth by the Basic Linear Algebra Subprograms Technical Forum. PPPPaaaaggggeeee 4444 DDDDGGGGEEEEMMMMMMMMSSSS((((3333SSSS)))) DDDDGGGGEEEEMMMMMMMMSSSS((((3333SSSS)))) DDDDGGGGEEEEMMMMMMMM(3S), SSSSGGGGEEEEMMMMMMMM(3S) to multiply general matrices by using the more standard _i_n_n_e_r _p_r_o_d_u_c_t algorithm PPPPaaaaggggeeee 5555